Global research hotspots, development trends and prospect discoveries of phase separation in cancer: a decade-long informatics investigation

Liquid-liquid phase separation (LLPS) is a complex and subtle phenomenon whose formation and regulation take essential roles in cancer initiation, growth, progression, invasion, and metastasis. This domain holds a wealth of underutilized unstructured data that needs further excavation for potentially valuable information. Therefore, we retrospectively analyzed the global scientific knowledge in the field over the last decade by using informatics methods (such as hierarchical clustering, regression statistics, hotspot burst, and Walktrap algorithm analysis). Over the past decade, this area enjoyed a favorable development trend (Annual Growth Rate: 34.98%) and global collaboration (International Co-authorship: 27.31%). Through unsupervised hierarchical clustering based on machine learning, the global research hotspots were divided into five dominant research clusters: Cluster 1 (Effects and Mechanisms of Phase Separation in Drug Delivery), Cluster 2 (Phase Separation in Gene Expression Regulation), Cluster 3 (Phase Separation in RNA-Protein Interaction), Cluster 4 (Reference Value of Phase Separation in Neurodegenerative Diseases for Cancer Research), and Cluster 5 (Roles and Mechanisms of Phase Separation). And further time-series analysis revealed that Cluster 5 is the emerging research cluster. In addition, results from the regression curve and hotspot burst analysis point in unison to super-enhancer (a=0.5515, R2=0.6586, p=0.0044) and stress granule (a=0.8000, R2=0.6000, p=0.0085) as the most potential star molecule in this field. More interestingly, the Random-Walk-Strategy-based Walktrap algorithm further revealed that “phase separation, cancer, transcription, super-enhancer, epigenetics”(Relevance Percentage[RP]=100%, Development Percentage[DP]=29.2%), “stress granule, immunotherapy, tumor microenvironment, RNA binding protein”(RP=79.2%, DP=33.3%) and “nanoparticle, apoptosis”(RP=70.8%, DP=25.0%) are closely associated with this field, but are still under-developed and worthy of further exploration. In conclusion, this study profiled the global scientific landscape, discovered a crucial emerging research cluster, identified several pivotal research molecules, and predicted several crucial but still under-developed directions that deserve further research, providing an important reference value for subsequent basic and clinical research of phase separation in cancer. Supplementary Information The online version contains supplementary material available at 10.1186/s40364-024-00587-9.


To the editor,
Complex biochemical reactions and substance metabolism exist within cancer cells, which may lead to uneven distribution of intracellular substances and the formation of liquid-liquid phase separation (LLPS) of different substances, that is, intracellular LLPS phenomenon.Such a LLPS could ultimately affect cancer proliferation, apoptosis, invasion, metastasis, and treatment sensitivity by influencing cell signaling, gene expression modulation, energy metabolism variation, and other mechanisms [1][2][3][4].
With decades of endeavors by oncology biologists and physicists, this field has amassed a wealth of unstructured data and continues to inflate exponentially, rendering it problematic for researchers to make sense of the intrinsic connections and evolutions of this information in a short period.Therefore, utilizing the informatics method, including hierarchical clustering, regression statistics, hotspot burst, and Walktrap algorithm analysis (Additional file 1) [5][6][7][8][9][10], we retrospectively analyzed the scientific knowledge in this field over the past decade, revealed the global research hotspots (GRHs) and development trends, and further identified the critical issues and directions worthy of in-depth exploration.

Discussion
In the last three years, the roles and mechanisms of LLPS in cancer have gradually received deep attention.First, we need to understand how LLPS occurs in cancer.In addition, in some situations, cancer cells develop more complex and diverse structures through LLPS, thus contributing to their survival, recovery, migration, and metastasis [2][3][4].However, notably, LLPS comprises only one fraction of the complex network in cancer, and further exploration of its roles and potential mechanisms in other biological processes (e.g., gene expression, drug response) will contribute to a better understanding and application of such a complex and subtle phenomenon.
Numerous results in this paper point in unison to super-enhancer as the most potential star molecule in this field.Super-enhancers are unique DNA structures that significantly enhance the efficiency of gene transcription, thereby promoting cancer growth and proliferation, but the specific molecular mechanisms by crucial emerging research cluster, identified several pivotal research molecules, and predicted several crucial but still under-developed directions that deserve further research, providing an important reference value for subsequent basic and clinical research of phase separation in cancer.Keywords Liquid-liquid phase separation, Cancer, Super-enhancer, Tumor microenvironment, Immunotherapy, Informatics analysis which they determine cell fate have been unclear [11].Subsequently, Sabari et al. demonstrated that the transcriptional co-activators bind at super-enhancer to isolate transcription-related components from the complex nucleus by LLPS, thereby regulating critical gene expression, providing a novel perspective for our understanding of gene regulation during cell fate determination and disease onset [12].However, the relationship and potential mechanisms of super-enhancer and LLPS, as revealed by the Walktrap algorithm in this study, are still under-developed and need further exploration.The same applies to the interactions between stress granules, tumor microenvironment, and immunotherapy (Additional file 7).A The population of regression-fitted curves based on the frequency of annual occurrence for the themes."a" indicates the slope of the fitted curve."R 2 " indicates the degree of correlation between the two variables.B The burst status and temporal evolution of the themes.C The research prospect discovery for the themes.After performing the Walktrap algorithm based on the random walk strategy, all themes were categorized into four quadrants.Quadrant I is for topics highly relevant to the field and already well-developed.Quadrants II and III are topics of low relevance to the field.Quadrant IV is for topics of high relevance to the field but still under-developed, indicating their research prospect.

Fig. 1
Fig. 1 Spatial and temporal distribution of global research hotspots on phase separation in the field of oncology.A Unsupervised learning hierarchical clustering for global research hotspots.The same colored nodes represent the same cluster.The size of the node indicates the total linkage strength.B Time series analysis of global research hotspots.A darker purple indicates a smaller average publication year, while a darker yellow indicates a bigger average publication year.C Spatial density network based on total linkage strength.D Spatial density network based on occurrence frequency.

Fig. 2
Fig. 2 Regression curve analysis, hotspot burst analysis, and research prospect discovery of research themes regarding phase separation in the field of oncology.A The population of regression-fitted curves based on the frequency of annual occurrence for the themes."a" indicates the slope of the fitted curve."R 2 " indicates the degree of correlation between the two variables.B The burst status and temporal evolution of the themes.C The research prospect discovery for the themes.After performing the Walktrap algorithm based on the random walk strategy, all themes were categorized into four quadrants.Quadrant I is for topics highly relevant to the field and already well-developed.Quadrants II and III are topics of low relevance to the field.Quadrant IV is for topics of high relevance to the field but still under-developed, indicating their research prospect.